Picture for Haonan Lu

Haonan Lu

Learning from Fine-Grained Visual Discrepancies: Mitigating Multimodal Hallucinations via In-Context Visual Contrastive Optimization

Add code
May 29, 2026
Viaarxiv icon

MergeTok: Unified Continuous and Discrete Visual Tokenization via Token Merging

Add code
May 29, 2026
Viaarxiv icon

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

Add code
May 26, 2026
Viaarxiv icon

expo: Exploration-prioritized policy optimization via adaptive kl regulation and gaussian curriculum sampling

Add code
May 11, 2026
Viaarxiv icon

X-OmniClaw Technical Report: A Unified Mobile Agent for Multimodal Understanding and Interaction

Add code
May 07, 2026
Viaarxiv icon

PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding

Add code
Apr 01, 2026
Viaarxiv icon

When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

Add code
Mar 22, 2026
Viaarxiv icon

Click-to-Ask: An AI Live Streaming Assistant with Offline Copywriting and Online Interactive QA

Add code
Mar 19, 2026
Viaarxiv icon

Thinking in Streaming Video

Add code
Mar 13, 2026
Viaarxiv icon

Learning from Prompt itself: the Hierarchical Attribution Prompt Optimization

Add code
Jan 06, 2026
Viaarxiv icon